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CLAIMS 



What is claimed is: 



1 1. In a computer system comprising a server machine and a client machine, a text-to- 

2 speech synthesis method comprising: 

3 a) obtaining a normalized text; 

4 b) selecting acoustic units corresponding to said normalized text from a database 

5 accessible to said server machine, said database storing a predetermined 

6 number of possible acoustic units; 

7 c) transmitting compressed acoustic units from said server machine to said client 

8 machine, wherein said compressed acoustic units are obtained by compressing 

9 said selected acoustic units using a compression method selected in 

10 dependence on said predetermined number of possible acoustic units; and 

11 d) in said client machine, concatenating said selected acoustic units. 

1 2. The method of claim 1, further comprising generating prosody data 

2 corresponding to said normalized text and transmitting said prosody data from 

3 said server machine to said client machine, wherein step (d) comprises 

4 concatenating said selected acoustic units in dependence on said prosody data. 

1 3. The method of claim 1 wherein step (d) further comprises concatenating said 

2 selected acoustic units with at least one cached acoustic unit, wherein said 

3 cached acoustic unit is cached on said client machine. 

1 4. The method of claim 1, further comprising normalizing a standard text to 

2 obtain said normalized text. 

1 5. The method of claim 1 wherein said possible acoustic units are compressed 

2 possible acoustic units, and wherein said compressed acoustic units are 

3 compressed before being stored in said database. 
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6. The method of claim 1 wherein parameters of said compression method are 
selected to minimize the amount of data transmitted between said server 
machine and said client machine for each possible acoustic unit. 

7. The method of claim 6 wherein parameters of said compression method 
are further selected to achieve a minimum quality for each possible 
acoustic unit. 

8. The method of claim 1 wherein steps (c) and (d) are performed simultaneously 
for sequential acoustic units. 

In a server machine, a text-to-speech synthesis method comprising: 

a) obtaining a normalized text; 

b) selecting acoustic units corresponding to said normalized text from a database 
storing a predetermined number of possible acoustic units; and 

c) transmitting compressed acoustic units to a client machine, wherein said 
compressed acoustic units are obtained by compressing said selected acoustic 
units using a compression method selected in dependence on said 
predetermined number of possible acoustic units. 

10. The method of claim 9, further comprising generating prosody data 
corresponding to said normalized text and transmitting said prosody data to 
said client machine. 

11. The method of claim 9, further comprising normalizing a standard text to 
obtain said normalized text. 

12. The method of claim 9 wherein said possible acoustic units are compressed 
possible acoustic units, and wherein said compressed acoustic units are 
compressed before being stored in said database. 

13. The method of claim 9 wherein parameters of said compression method are 
selected to minimize the amount of data transmitted to said client machine for 
each possible acoustic unit. 
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14. The method of claim 13 wherein parameters of said compression 
method are further selected to achieve a minimum quality for each 
possible acoustic unit. 

In a client machine, a text-to-speech synthesis method comprising: 

a) receiving compressed acoustic units corresponding to a normalized text from a 
server machine, said compressed acoustic units being selected from a 
predetermined number of possible acoustic units and compressed using a 
compression method selected in dependence on said predetermined number of 
possible acoustic units; 

b) decompressing said compressed acoustic units to obtain decompressed acoustic 
units; and 

c) concatenating said decompressed acoustic units. 

16. The method of claim 15, further comprising receiving prosody data 
corresponding to said normalized text from said server machine, wherein step 
(c) comprises concatenating said decompressed acoustic units in dependence 
on said prosody data. 

17. The method of claim 15 wherein step (c) further comprises concatenating said 
decompressed acoustic units with at least one cached acoustic unit. 

18. The method of claim 15 further comprising, before step (a), transmitting a 
standard text corresponding to said normalized text to said server machine. 

19. The method of claim 15 further comprising, before step (a), normalizing a 
standard text to obtain a normalized text, and transmitting said normalized text 
to said server machine. 

20. The method of claim 15 wherein parameters of said compression method are 
selected to minimize the amount of data transmitted to said client machine for 
each possible acoustic unit. 
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21. The method of claim 20 wherein parameters of said compression 
method are further selected to achieve a minimum quality for each 
possible acoustic unit. 

22. The method of claim 15 wherein steps (a), (b), and (c) are performed 
simultaneously. 

A text-to-speech synthesis system comprising: 

a) a database of predetermined acoustic units; 

b) a server machine in communication with said database for selecting ones of 
said acoustic units corresponding to a normalized text and for generating 
prosody data corresponding to said normalized text; and 

c) a client machine in communication with said server machine for concatenating 
said selected acoustic units in dependence on said prosody data; 

wherein said server machine transmits compressed acoustic units to said client 
machine, and wherein said compressed acoustic units are obtained by compressing 
said selected acoustic units using a compression method selected in dependence on 
said predetermined acoustic units. 

24. The system of claim 23 wherein said client machine contains at least one 
cached acoustic unit. 

25. The system of claim 23 wherein said server machine normalizes a standard text 
to obtain said normalized text. 

26. The system of claim 23 wherein said client machine normalizes a standard text 
to obtain said normalized text and transmits said normalized text to said server 
machine. 

27. The system of claim 23 wherein said predetermined acoustic units in said 
database are compressed predetermined acoustic units. 
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28. The system of claim 20 wherein parameters of said compression method are 
selected to minimize the amount of data transmitted between said server 
machine and said client machine. 

29. The system of claim 28 wherein parameters of said compression 
method are further selected to achieve a minimum quality for each 
predetermined acoustic unit. 

A program storage device accessible by a server machine, tangibly embodying a 
program of instructions executable by said server machine to perform method steps 
for a text-to-speech synthesis method, said method steps comprising: 

a) obtaining a normalized text; 

b) selecting acoustic units corresponding to said normalized text from a database 
storing a predetermined number of possible acoustic units; and 

c) transmitting compressed acoustic units to a client machine, wherein said 
compressed acoustic units are obtained by compressing said selected acoustic 
units using a compression method selected in dependence on said 
predetermined number of possible acoustic units. 

3 1 . The device of claim 30 wherein said method steps further comprise generating 
prosody data corresponding to said normalized text and transmitting said 
prosody data to said client machine. 

32. The device of claim 30 wherein said method steps further comprise 
normalizing a standard text to obtain said normalized text. 

33. The device of claim 30 wherein said possible acoustic units are compressed 
possible acoustic units, and wherein said compressed acoustic units are 
compressed before being stored in said database. 

34. The device of claim 30 wherein parameters of said compression method are 
selected to minimize the amount of data transmitted to said client machine for 
each possible acoustic unit. 
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The device of claim 34 wherein parameters of said compression 
method are further selected to achieve a minimum quality for each 
possible acoustic unit 
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